Korpus: vec_wikipedia_2012

Weitere Korpora

2.2.12 Typical Prefixes and Suffixes

Typical prefixes and suffixes of words of length 1 and 2 using Levenshtein

Common Prefix
Prefix Count Percent
L'- 708 0.2784
L’- 304 0.1195
ł'- 285 0.1121
ri- 227 0.0893
de- 206 0.0810
a- 206 0.0810
s- 165 0.0649
in- 141 0.0554
1- 98 0.0385
2- 92 0.0362
I- 91 0.0358
Re- 86 0.0338
co- 82 0.0322
n'- 82 0.0322
di- 73 0.0287
d'- 67 0.0263
e- 67 0.0263
d- 51 0.0201
p- 42 0.0165
da- 39 0.0153
Common Suffix
Suffix Count Percent
-e 946 0.3720
-i 907 0.3566
-a 542 0.2131
-o 484 0.1903
-r 436 0.1714
-da 234 0.0920
-va 210 0.0826
-re 177 0.0696
-se 174 0.0684
-tà 169 0.0665
-n 91 0.0358
-l 82 0.0322
-s 80 0.0315
-łe 77 0.0303
-lo 69 0.0271
-li 66 0.0260
-al 64 0.0252
-na 63 0.0248
-ni 62 0.0244
-łi 62 0.0244
Ratio of Suffixes/Prefixes
Ratio
1.6183
Word Pairs with 2>=Levenshtein among top words
Top words Count
100 10
300 52
1000 460
3000 3556
10000 26196
30000 147672
44105 254316


Word Pairs with 2>=Levenshtein among top words


Gnuplot diagram

3558 msec needed at 2018-01-27 05:20